Picture for Yunpu Ma

Yunpu Ma

ProactiveLLM: Learning Active Interaction for Streaming Large Language Models

Add code
May 30, 2026
Viaarxiv icon

EchoRL: Reinforcement Learning via Rollout Echoing

Add code
May 29, 2026
Viaarxiv icon

Memory-R2: Fair Credit Assignment for Long-Horizon Memory-Augmented LLM Agents

Add code
May 20, 2026
Viaarxiv icon

Mem$^2$Evolve: Towards Self-Evolving Agents via Co-Evolutionary Capability Expansion and Experience Distillation

Add code
Apr 13, 2026
Viaarxiv icon

Routing-Free Mixture-of-Experts

Add code
Apr 01, 2026
Viaarxiv icon

Think-as-You-See: Streaming Chain-of-Thought Reasoning for Large Vision-Language Models

Add code
Mar 03, 2026
Viaarxiv icon

HiDrop: Hierarchical Vision Token Reduction in MLLMs via Late Injection, Concave Pyramid Pruning, and Early Exit

Add code
Feb 27, 2026
Viaarxiv icon

UTPTrack: Towards Simple and Unified Token Pruning for Visual Tracking

Add code
Feb 27, 2026
Viaarxiv icon

Rethinking the Role of LLMs in Time Series Forecasting

Add code
Feb 16, 2026
Viaarxiv icon

On-Policy Supervised Fine-Tuning for Efficient Reasoning

Add code
Feb 13, 2026
Viaarxiv icon